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Abstract 



Most approximation algorithms for #P-complete problems (e.g., evaluating the permanent of 
a matrix or the volume of a polytope) work by reduction to the problem of approximate sampling 
from a distribution 7r over a large set S. This problem is solved using the Markov chain Monte 
Carlo method: a sparse, reversible Markov chain P on S with stationary distribution 7r is run 
to near equilibrium. The running time of this random walk algorithm, the so-called mixing 
time of P, is 0(S~ 1 logl/7r») as shown by Aldous, where S is the spectral gap of P and 7r* is 
the minimum value of n. A natural question is whether a speedup of this classical method to 
0(V (5 _1 log 1/7T*), the diameter of the graph underlying P, is possible using quantum walks. 

We provide evidence for this possibility using quantum walks that decohere under repeated 
randomized measurements. We show: (a) decoherent quantum walks always mix, just like their 
classical counterparts, (b) the mixing time is a robust quantity, essentially invariant under any 
smooth form of decoherence, and (c) the mixing time of the decoherent quantum walk on a 
periodic lattice Zf t is 0(nd\ogd), which is indeed 0(V<5 _1 logl/V*) and is asymptotically no 
worse than the diameter of (the obvious lower bound) up to at most a logarithmic factor. 

1 Introduction 

1.1 Markov chain Monte Carlo and quantum walks 

A rich theory has been developed for computing approximate solutions to problems in combinatorial 
enumeration and statistical physics which are ^P-complete and therefore unlikely to have efficiently 
computable exact solutions. Among the highlights are randomized polynomial-time approximation 
schemes for evaluating the permanent of a nonnegative matrix (Jerrum, Sinclair, and Vigoda [20J), 
the volume of a convex polytope (Dyer, Frieze, and Kannan [E]), and the partition functions 
of monomer-dimer and ferromagnetic Ising systems (Jerrum and Sinclair [18^ I19j). At the heart 
of these algorithms (and consuming most of the running time) is a subroutine for approximate 
sampling from a particular distribution ir over a large set S. This problem is solved using the Markov 
chain Monte Carlo (MCMC) method: a sparse, reversible Markov chain (stochastic matrix) P on 
S with stationary distribution tt is run from an arbitrary initial state to a random state distributed 
very close to tt. The time required to guarantee this convergence, or mixing, is the so-called mixing 
time r m i x . Bounding r m ; x is typically the chief technical hurdle in analyzing the running time of 
the overall algorithm. It is reduced to the problem of estimating the spectral gap 5 of P by Aldous' 
inequality [3] 
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where 7r* is the minimum value of tt. Since 8 is generally no easier to estimate directly than T m i x , 
we often bound the conductance $ (a geometric parameter of the chain) and appeal to the Cheeger 
inequality (see e.g., Alon [5]): 

< 5 < 2$ (2) 

Further inequalities (e.g., involving congestion of multicommodity flows through the chain) are 
often invoked on top of these. 

Notice that Aldous' inequality ([1]) is tight with respect to the spectral gap. However, it is also 
somewhat unsatisfactory in that although the diameter of the graph underlying P (the obvious lower 
bound for sampling from its stationary distribution) scales like 0(V <5 _1 log 1/tt*), the dependence is 

in Aldous' inequality due to the Gaussian-like spreading behavior of random walks. In MCMC 
applications, the additional factor of V 5~ l can mean the difference between, say, an n 3 and n 5 
algorithm! Removing it would imply a considerable improvement in both the known upper bounds 
(since existing estimates of 5 could be used in conjunction with a sharper inequality than Aldous') 
and the true upper bounds (since Aldous' inequality is tight and thus a sharper inequality could 
only come from a faster approximate sampling method). Thus, a natural question is whether there 
is a way to modify the standard MCMC method to obtain a speedup to 0(s/ <5 -1 log 1/tt*). This 
seems unlikely using classical randomized methods: Chen, Lovasz, and Pak [9] have shown that 
we can sometimes speed up mixing by lifting a Markov chain, but this requires both knowledge of 
the chain's global structure and its use in solving an NP-hard flow problem to find low-congestion 
paths along which to "route" probability mass efficiently However, although lifting the chain seems 
unlikely to be practical, an idea that might work is quantizing the chain. 

Why might a quantized Markov chain, or quantum walk, help us reach 7r more quickly? Two 
reasons are: (1) a quantum walk is as simple to realize as its classical counterpart (i.e., it is 
computable locally and online, unlike a classical lifting of the chain), and (2) there is empirical 
(and some theoretical) evidence that quantum walks tend to propagate and "spread" probability 
mass across S in time on the order of the diameter on precisely the same low-dimensional graphs 
that trip up their classical counterparts. Based on these observations, the possibility of obtaining 
a quantum speedup for the mixing problem has been pursued by Nayak et al. (HUE], Aharonov et 
al. PP, Moore and Russell [29], Gerhardt and Watrous [15], and Richter [51] . 

We remark that a quantum speedup theorem of the sort we seek has already been proven 
for the hitting problem, in which we search (rather than sample from) the states of a Markov 
chain: Szegedy [34] proved a quadratic quantum speedup for the hitting time of any symmetric 
Markov chain, generalizing considerably the celebrated search algorithm of Grover [16] and implying 
quantum speedups for structured search problems such as element distinctness (Ambainis [6]), 
triangle finding (Magniez, Santha, and Szegedy [28]), matrix product verification (Burhman and 
Spalek [8]), and group commutativity testing (Magniez and Nayak [26]). It is this success which 
inspires us to investigate the possibility of a quantum speedup for the mixing problem with the 
goal of transferring the speedup to MCMC algorithms. 

1.2 Our contributions 

We present evidence of a possible quantum MCMC speedup to 0(V 5 _1 log 1/7T*) using quantum 
walks that decohere under repeated randomized measurements. Decoherence (in small amounts) 
was first identified as a way to improve spreading and mixing properties in numerical experiments 
performed by Kendon and Tregenna [23] and in analytical estimates by Fedichkin et al. [14^ 132 1 [33]. 
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On the other hand, high rates of decoherence in quantum walks have been shown to degrade mixing 
properties substantially by the quantum Zeno effect ( Alagic and Russell [3] ) . For an excellent survey 
of these and other aspects of decoherent quantum walks, see Kendon }22j. 

Our technical contributions are as follows: First, we show that for any symmetric Markov chain 
P, we can generate an arbitrarily good approximation to the uniform stationary distribution tt 
of P by subjecting the continuous-time quantum walk U c t(P) = exp(iP) to reasonably "smooth" 
decoherence. Thus, decoherent quantum walks (which are non-unitary) offer a way of circumventing 
an obstacle first identified by Aharonov et al. [I], who observed that unitary quantum walks often 
converge (in the time- aver aged sense) to highly non- uniform distributions. 

Second, we show that the optimal mixing time of a decoherent quantum walk is a robust 
quantity, in that it remains essentially invariant under any sufficiently smooth form of decoherence. 
In particular, decoherent quantum walks undergoing repeated Cesaro-averaged [H I29] [T5] and 
Bernoulli/Poisson- averaged [23[ [3] measurements are nearly equivalent in mixing efficiency. The 
proof applies more generally to a game involving time-dependent Markov chains (not necessarily 
describing quantum phenomena) and may be of independent interest. 

Third, we prove a theorem on threshold mixing of quantum walks on (Cartesian) graph powers 
in order to show that the decoherent continuous-time quantum walk U c t{P[G)) on a periodic lattice 
G = (where P(G) denotes the standard Markov chain on G) can be used to generate a good 
approximation to the uniform stationary distribution tt of P in time 0(ndlogd). This upper 
bound is asymptotically no worse than the diameter of (the obvious lower bound) up to at most 
a logarithmic factor and is 0(V <5 _1 log 1/tt*) for both high-dimensional and low-dimensional lattices 
(unlike its classical counterpart). For d = 1, this proves a conjecture of Kendon and Tregenna [25] 
based on numerical experiments For d > 1, it extends the results of Fedichkin et al. [14[ [321 [33] 
by confirming O(n) and 0(d log d) scaling (suggested by their analytical estimates and numerical 
experiments in regimes of both high and low decoherence) of the fastest-mixing walk, which they 
conjectured to be decoherent rather than unitary. We briefly discuss the prospects for extending 
this result to the discrete-time Grover walk U dt {P{1,i)) [1711551125] . 

Previously, mixing speedups had been proven only for the unitary quantum walks of Nayak et 
al. [301 IT] and Aharonov et al. [1] (on the cycle) and of Moore and Russell [29] (on the hypercube). 
Thus, our work shows that introducing a small amount of decoherence to a quantum walk can 
simultaneously force convergence to the uniform distribution whilepreserving a quantum mixing 
speedup, an advantageous combination for algorithmic applications □ 

2 Preliminaries 

2.1 Markov chains 

Let P be a Markov chain (stochastic matrix) on the set S {\S\ = N) which is irreducible (strongly 
connected); then it has a unique distribution tt which is stationary (i.e., satisfies Pir = tt). Moreover, 
tt is strictly positive: 7r* := min x tt x satisfies 7r* > 0. In particular, if P is symmetric, then tt is the 

In fact this is not quite true: Kendon and Tregenna [23] conjectured this for the discrete-time Hadamard walk. 
We prove it for both walks. 

2 Alagic and Russell [3] exhibit a decoherent quantum walk on the hypercube which converges to the uniform 
distribution and preserves the quantum mixing speedup proven by Moore and Russell [25], but in contrast to our 
work, it is their single (carefully chosen) final measurement that forces uniform convergence, not the decoherence 
itself (which only adds "noise" that if small enough does not destroy the speedup). 
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uniform distribution u (the iV-dimensional column vector with each component equal to 1/N). If 
G is an undirected graph, we denote the standard ("simple random walk") Markov chain on G by 
P{G). 

A Markov chain P which is both irreducible and aperiodic (non-bipartite) is by definition ergodic 
and satisfies 

P t -> vrl 1 " = [vr • • • 7r] as t -> oo (3) 

where it is the A^-dimensional row vector with each component equal to one. We can thus define 
the (threshold) mixing time 

r mix := min{T : i||P* - ttI+Hi < ^ V< > T} (4) 

where || • ||i is the matrix 1-norm. The mixing is perfect if P* = vrl+. Let 5 := 1 - ||P|^±|| 2 > be 
the spectral gap of P. We say that P is reversible if the matrix DPD~ l is symmetric, where I? is 
the diagonal matrix D(x,x) = v / 7f^. A precise statement of Aldous' inequality (JTJ) is: 

Theorem 2.1 (Aldous |4j) Let P be a reversible, ergodic Markov chain with stationary distribu- 
tion 7r and spectral gap 5. Then its mixing time satisfies d^ 1 < T m ; x < (1 + 5 log l/V*). 

We will also use the maximum pairwise column distance d{P) := max. XjX i ^\\P(-,x) — P(-,a/)||i 
to estimate the mixing time. It is related to the matrix 1-norm distance by the inequality: 

±\\P-Trl%<d(P)<\\P-irl% (5) 

The following propositions (see [31] for proofs) can be used to estimate the mixing time of P 
given a common lower bound on most of the entries in each column. 

Proposition 2.2 If d(P) < a, then r m i x < [log^ 2e] . 

Proposition 2.3 If at least (3N entries in each column of P are bounded below by j/N, where 
P>\ and 7 > 0, then d(P) < 1 - 7(1 - 2(1 - /?)). 



2.2 Quantum walks 

Heretofore, a quantum walk is a pair (U,ujt) where the transition rule U is a unitary operator 
acting on a finite-dimensional Hilbert space and the measurement rule lot is a T-parametrized 
family of probability density functions on [0, 00) characterizing the (random) time at which a total 
measurement is performed on the Hilbert spaceH 

The unitary transition rule determines the orbit of a pure quantum state, or wavefunction (I2- 
normalized complex vector), just as a Markov chain (or stochastic transition rule) determines the 
orbit of a classical distribution (^-normalized nonnegative vector). Let P be the Markov chain used 
in an MCMC algorithm; in particular, P is reversible. Two natural quantizations of P are: (a) the 
S xS unitary continuous-time walk given by U c t(P) = exp(iDPZ) -1 ) [13^12]. where H = DPD~ l is 
the time-independent Hamiltonian, and (b) the S 2 x S 2 unitary discrete-time Grover/Szegedy walk 

' ! Though repeatedly measured quantum walks generate mixed states, our analysis is simplest without the intro- 
duction of density matrices. We refer the reader lacking sufficient background in quantum computing to the excellent 
text of Kitaev, Shen, and Vyalyi [24] , 
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given by Udt(P) = (RS) 2 [33], where S is the involution ^2 xy€S \x,y) > \y,x), R is the reflection 

T,xes \ x )( x \ ® ( 2 IPx)(Px| - I), and |p x ) is the vector J2 y eS V P (y^ x )\v)- The quantization U ct (P) 
was used by Childs et al. [11] to solve in polynomial time a natural oracle problem for which no 
classical polynomial-time algorithm exists. The quantization Udt(P) was used by Szegedy [33] to 
prove a quadratic quantum speedup for the hitting time of any symmetric Markov chain. 

Measurement collapses the wavefunction to a classical distribution according to the map \<p) \— > 
Ylx l x )K x l^)l 2 - For a quantum walk (U,u>t), the Markov chain generated by the quantum walk is 
given by Pr(y,x) := E t ^ LUT [\(y\U t \x)\ 2 ], where E denotes expected value. We say that the quantum 
walk threshold-mixes if the Markov chain it generates mixes in time 0(1). Examples of measurement 
rules from the literature include the point distribution 5x(t) := 5(t—T) where 5 is the delta function 
[301 [7], the uniform distribution JIt '■= ffX[o,T] an d its discrete-time counterpart vt '■= fX[o..T-i] 
where \ is the characteristic function [JJ, the exponential distribution p,T(t) '■= ^exp(— t/T), 
and the geometric distribution ur(t) := y(l — y)*. The exponential and geometric distributions 
are memoryless and describe the interarrival time between measurements in a Poisson process with 
measurements occurring at rate A = 1 /T and a Bernoulli process with measurements occurring with 
probability p = 1/T at each timestep, respectively. These processes coincide with the decoherence 
models of Alagic and Russell [3j and Kendon and Tregenna [23], respectively. 

Nayak et al. [30j[7] and Aharonov et al. p] showed that the so-called Hadamard walks (Unad, 3t) 
and {Utfad, Pt) ° n the cycle Z n threshold-mix quadratically faster than the classical random walk, 
although their definitions of threshold mixing are slightly different than ours. Moore and Russell 
[29] showed that the continuous-time walk (U c t(P(G)),5T) and the Grover walk (Udt(P(G)), St) on 
the hypercube G = TL\ mrx perfectly and almost perfectly, respectively, in time T = 0(d). They 
also showed that the continuous-time quantum walk on the hypercube with measurement £it does 
not mix to the uniform stationary distribution ir of P^^) m the limit T — > oo. Gerhardt and 
Watrous [15] showed the same for a continuous-time quantum walk on the symmetric group with 
measurement JIt- See the survey by Kempe [21] for further results on quantum walks. 

We remark that there is a nice way to use the quantization Udt(P) to solve the mixing problem 
in time OfT/V^ 71 "*) which, although prohibitively costly in it*, exhibits the desired dependence on 
5. Consider the stationary eigenstate \tt) := J2 x es V^x\ x )\p%) °^ Udt(P)- It is clear that we can 
retrieve a good approximation to the classical distribution tt by generating and then measuring a 
good approximation to |7r)o Magniez et al. [27] observe that 0(1/ V5) steps of phase estimation 
on Udt(P) enable us to reflect about \tt). By alternating this reflection with a reflection about 
\z) := |z)|p 2 ) where z is the initial walk state (in particular, (z\tt) > ^/^^^), we can generate \tt) 
from \z) in time 0(l/y/5n*). In fact, this algorithm is described by Magniez et al. [27] as a hitting 
algorithm (i.e., generate the unknown state \z) from the fixed initial state | vr) ) ; the idea of running 
a quantum hitting algorithm in reverse as a mixing algorithm was suggested by Childs [10] . 

3 Mixing properties of decoherent quantum walks 

3.1 Two types of convergence 

Let Pt be the Markov chain generated by a quantum walk (U,ujt)- Then repeating the quantum 
walk T 1 times in succession generates the Markov chain (Pt) T ■ The following lemma (a variant 

4 Similarly, we can retrieve a good approximation to 7r by generating and then measuring a good approximation 
to the ground state \tt) = X^es V^\ x ) °f ^ ne Hamiltonian H = DPD~ 1 [2]. 
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of Theorem 3.4 in Aharonov et al. pQ) and theorem describe the asymptotic behavior of Pt and 
(Pt) T m the limits T — > oo and T' — > oo, respectively. For concreteness we will take U = U c t in this 
subsection and the next; it is a simple exercise to extend the results to discrete-time walk variants. 
Although stated explicitly for quantum walks, the results apply to any time-independent quantum 
dynamics on a finite-dimensional Hilbert space subjected to random destructive measurements. 

Lemma 3.1 (The limit T — ► oo) Let P be a symmetric Markov chain and lot be a family of 
distributions satisfying Et<-uj T {e lSt ] — > as T — > oo for any 6^0. In the limit T — ► oo, the Markov 
chain Pt generated by the quantum walk {U c t(P),uJT) approaches the Markov chain IT with entries 

n(y,x):=^|^(y|0 fc )(^|x)| 2 (6) 

where \<j>k)} is the spectrum of P and {Cj} is the partition of these indices k obtained by 
grouping together the k with identical A&. 

Proof: Decomposing the quantum walk along spectral components gives us: 

P T (y,x) = E t ^ T [\J2(y\<t>k}(<f>k\x)e akt f ] (7) 
k 

Writing | • | 2 as a product of complex conjugates, we obtain: 

P T (y,x) = ^ r [(^(y|^)(^|x))(^(^|y)(x|0 / ))e i ( A '=- A ') i ] (8) 

k i 

= E(yi^)(^i^))(E^iy)^i^))^T[e i(Afc - A;)t ] (9) 
k i 

Now by assumption, Et^ WT [e l ( Xk ~ x ^ t ] vanishes as T — > oo for all 7^ A;, so we have 

P T (y,x) - ^2{y\<f> k ){<l>k\x)( Yl {Mv){^k}) = E I E {y\<t>k){<t>k\x)\ 2 = U(y,x) (10) 
k i-.e l= e k j kec 3 

in the limit T — > 00. ■ 

It can be inferred from this lemma that most quantum walks converge to a distribution p other 
than the uniform stationary distribution ir = u, and that p is not even independent of the initial 
walk stateH There are exceptions to this rule, for example quantum walks with distinct eigenvalues 
on Cayley graphs of Abelian groups (as observed by Aharonov et al. [1]), but they are not likely 
to arise in MCMC applications where the Markov chains have little structure. How then are we to 
sample from u using quantizations of these Markov chains? Here is where decoherence helps. 

Theorem 3.2 (The limit T' — > 00) Let P be a symmetric, irreducible Markov chain and lot be a 
family of distributions satisfying Et<-u T [e l0t ] — > as T — > 00 for any 9^0. For T sufficiently large 
(but fixed), the T' -repeated quantum walk (U c t(P) , <jJt) generates a Markov chain {Pt) T approaching 
uv in the limit T' —> 00. 

5 For a walk on the symmetric group S n , Gerhardt and Watrous [15] showed that |||IT — > ^ — ^t( 2 "JTi 2 ); 

for a walk on the hypercube Z2, Moore and Russell [29] showed that there exists an e > such that ^ ||IT — 
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Proof: We need to show that for T sufficiently large, the Markov chain Pt is ergodic with uniform 
stationary distribution. 

That the uniform distribution is stationary is clear: each of the Pt(y,x) := \(y\e lPt \x}\ 2 has 
uniform stationary distribution since the uniform classical state is invariant under unitary quantum 
operations and under total measurement of the system; thus, any probabilistic combination Pt of 
them has uniform stationary distribution. 

To show that Pt is ergodic for all sufficiently large T, it is sufficient (by Lemma l3.ip to prove 
that LT is ergodic. (The latter implies the former because the ergodic matrices form an open subset 
of the set of stochastic matrices.) Why is II ergodic? Because the 1-eigenspace of P is precisely 
the space spanned by u, so it follows from Lemma 13. II (by consideration of only this nondegenerate 
eigenspace in the expression (jH))) that IL(y,x) > l/N 2 for every x,y. 

In fact, each of the Pt (and so Pt and LT as well) is symmetricjj To see this, write out the Taylor 
series for exp(iPt) and note that every positive integer power P k is symmetric (since P 2 (x,y) = 
~^2 Z P(x, z) ■ P(z,y) = P(y, z) ■ P(z,x) = P 2 (y,x)). This property will be quite useful in the 
next subsection: it will allow us to use Theorem 12. II to relate the spectral gap and the mixing time 
of P T . m 



3.2 Invariance of the mixing time 

Consider the quantum walks (Uct(P),p>T) an d {U c t(P),JtT) where P is a symmetric Markov chain. 
We show that these two quantum walks mix with essentially the same efficiency. The result extends 
beyond Ji,Jt to any pair of measurements u>,u>' which are sufficiently smooth or have nontrivial 
overlap as distributions. 

Let Pt and Pt be the Markov chains generated by the walks with measurement JL and /}, 
respectively, and let St '■= 1 — ||PtL x I|2 and 5t '■= 1 — ||PrL x ll2 be their respective spectral gaps. 

Lemma 3.3 (Spectral gap inequality) Let St and 5t be defined as above. Then for any k > 1 
we have the inequality: 

e- 1 5T<5 T <k(l-e- k )-5 k T + 2e- k (11) 

Proof: Suppose we want to simulate Pt by Pt- Scaling the distribution JIt by a := 1/e allows 
us to "fit it inside" the distribution JIt (he., e JIt < fir pointwise), so we can express JIt as the 
probabilistic combination olJxt + (1 — ot)v for some distribution u, so that 

P T = Et^fr [P t ] = aEt^ T [P t ] + (1 - a)E t ^ u [P t ] = aP T + (1 - a)Q (12) 

where Q is stochastic with uniform stationary distribution. It follows that 

\\Pt\ u A\2 < l/e||Pr| u x|| 2 + (1 - l/e)||Q| u x|| 2 (13) 

which implies that St > 1/e • 5t since ||Q| u x||2 < 1- 

Suppose we want to simulate Pt by PkT- Then the basic approach is the same, but since the 
support of [It is not compact we have to be careful. Scaling the distribution JIt by := 1/k allows 
us to fit it inside the distribution Jl^t up to the point t = kT, and the probability mass in Jlt past 
t = kT is only Pr t ^ T [£ > kT] = e~ k . So we can write 

fa = (1 - e ~ k ) ■ Ji\ eaA + e~ k ■ Jl 1 S U (14) 
6 II is also positive semidefinite: it is the Gram matrix of {f s } with f s (kl) :— {s\(f>k}{4>i\ s } if ^fe — otherwise. 
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where jl^ ad and /i^*' are the conditional distributions of \xt such that t < kT and t > kT, 
respectively; thus, 

P T = (1- e~ k ) ■ P£ ead + e~ k ■ Pft il (15) 

where P£ ead and PJj? il are the expectations of P t with respect to fi T ead and jitp tl , respectively. Since 
we can fit fi T ead inside p,^ if we scale it by 1/k, we can write 

PkT = \P¥ ad + ^-\)Q (16) 
where Q is stochastic with uniform stationary distribution. The above equations yield: 

P kT = ^(^-e-^ ) + ( i-l)g 

p T ~ ,*~\, P? il + (1 " \)Q (17) 



k(l-e~ k ) k(l-e~ k ) 1 y k 
From the triangle inequality, we obtain 

IfeUh < k{l ' e - k) \\PT\ u ,\\2 + ^l^ PTX-h + (1 " \)\\Q\ U A\2 (18) 

and, rearranging terms and simplifying: 

-(1 - H^rU|| 2 ) - - n 2e * < 1 - ||At| u x|| 2 (19) 



k{l-e~ k y " J|B lw fc(l- e -^ 



Theorem 3.4 (Equivalence of measurements) Let P be a symmetric Markov chain. Then: 
(a) if the T' -repeated quantum walk (U c t(P), JIt) threshold-mixes, then the T' ■ O (log N) -repeated 
quantum walk (U c t(P),p>T) threshold-mixes; (b) if the T 1 -repeated quantum walk (U c t(P) , /2t) threshold- 
mixes, then the T' ■ O (logT' log N) -repeated quantum walk (U c t(P), Ar-O(iogT')) threshold-mixes. 

Proof: To see (a), note that our assumption implies that Pt mixes in time T". Therefore, 5t = 
Q(l/T') by Theorem I2.1|, and from Lemma 13.31 it follows that 5t = 0(1/T"). Applying Theorem 
12.11 again, we obtain for Pt a mixing time of 0(T' log N). 

The proof of (b) is almost as straightforward. Our assumption implies that Pt mixes in time 
T", so 5t = Q(l/T') by Theorem 12.11 Set k to be the smallest integer for which St > 3e~ fc ; in 
particular, k = 0(log5^ 1 ) = O(logT'). By Lemma [3731 

>- ijr^j* - 2 ^> * wh^r k) = e( = e( W ; (20) 

Applying Theorem 12.11 again . we obtain for a mixing time of 0(T' log T' log TV). ■ 

It should be readily apparent that this equivalence holds for any two measurement rules with 
finite expectation and significant overlap for most T. We also remark that although the above 
lemma and theorem are stated in terms of quantum walks, the proofs indicate that they are merely 
statements about an abstract game involving a collection of symmetric Markov chains {Pt}t>o 
and a T-parametrized family of probability measures {wy}, where we seek to minimize the "cost 
function" T • T' . 
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4 Quantum speedup for periodic lattices 



4.1 A near-diameter upper bound 

The classical random walk on the periodic lattice Z^ (with N = n d vertices and diameter \n/2\d) 
has uniform stationary distribution tt = u and spectral gap 5 = 0(min{^, -?}). It threshold- 
mixes in time @(n 2 dlogd), which is 0(V <5 _1 log l/V*) only when is quite high-dimensional: in 
particular, when d is roughly of order n 4 or larger. We show that a few repetitions of the continuous- 
time quantum walk can bring this down to 0(ndlogd), which is 0(V 5~ 1 log l/ir*) for any d > 1 
and n > 2 and asymptotically no worse than the diameter of Z^ up to at most a logarithmic factor. 
First we prove a lemma governing mixing of various decoherent quantum walks on the cycle Z^ . 

Lemma 4.1 (Mixing on cycles) Let Z n be the cycle on n > 2 vertices. The continuous-time 
walks (C/ c t(P(Z n )), u>t) with measurement uj G {<5, p., ft,} threshold-mix for any T G T := [| • ^, 
and t/ie Hadamard walks (C/// ac ;(Z n ), wp) mi/i measurement uj G \v,v} threshold-mix for any T G 

^ [2 n n 1 

Proof: Consider first the continuous-time walk. To prove that it threshold-mixes with any of the 
measurements uj G {5, /Z, fx} for any T £ I, it suffices by Proposition 12.21 to show that for every 
t G 1' := [§ • § , | • f ], d(Pi) is bounded below one by a positive constant, where Pt is the Markov 
chain generated with measurement uj = 5. (Indeed, this easily implies that c?(Pr) and d{Px) 
are bounded below one by a smaller positive constant, where Pt and Pt are the Markov chains 
generated with measurement u = ft and to = p,, respectively.) 

Let \<f>t) and \ipt) be the wavefunctions at time t for the continuous-time walks on Z and Z n , re- 
spectively, starting from the origin (without loss of generality, since Z and Z n are vertex-transitive). 
Then for each y G Z n we have: 

m)= £ <y|&> (21) 

j/=j; mod n 

Childs [10] shows that (y\4>t) = (— i) y Jy(t)i where J y is a Bessel function of the first kind. In 
particular, for \y\ 3> 1 the quantity |</ y (£)| is (a) exponentially small in \y\ for t < (1 — e) • \y\ and 
(b) of order |y| -1 / 2 for t > (1 + e) ■ \y\. For every t < | • §, property (a) implies that the only term in 
the above summand that is non-negligible is the (y\4>t) with \y\ < ^ (call it y and note that y <-> y 
is a 1-1 correspondence), so we can use property (b) to conclude that up to a negligible correction 

IWt)l^l(y|^)l = e(i/v^) (22) 

for every \y\ ^> 1 and t > (1 + e) • \y\. In particular, the nearly |n different y with 1 <C |y| < | • § 
satisfy KylV't)! = ^(Vv^)) an d therefore Pt(y,0) = J7(l/n), for every t G X'. So by Proposition 
12.31 d(Pt) is bounded below one by a positive constant. 

For the Hadamard walk, the wavefunction is no longer characterized by Bessel functions, but 
it retains the same essential asymptotic spreading behavior as its continuous-time counterpart (see 
Nayak et al. [30^ [7]), and the argument above works with little modification. A caveat is the 
emergence of a parity problem: if n is even, then Z n is bipartite and the wavefunction is supported 
only on vertices of the same parity at each integer timestep. Hence the Hadamard walk with u = 5 
threshold-mixes only on vertices of the same parity, but with time-averaged measurement uj = v or 
to = v parity is broken and threshold-mixing occurs on the entire vertex set. 
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Although the argument above relies on the asymptotic behavior of the wavefunction as n — > oo, 
this is clearly the difficult case: if n is bounded, then it suffices to show only that there exists a time 
(or a pair of consecutive timesteps, in the case of the Hadamard walk) in which the wavefunction 
is supported on at least 2/3 of the vertices. ■ 

For the Hadamard walk with measurement uj = v % Lemma 14.11 resolves a conjecture of Kendon 
and Tregenna [23] based on numerical experiments. 

Let G d denote the dth (Cartesian) power of a graph G. Examples are the d-dimensional standard 
lattice (the dth power of a line) and the d-dimensional periodic lattice (the dth power of a cycle). 
The following theorem shows how to extend a threshold-mixing result from G to G d . 

Theorem 4.2 (Mixing on graph powers) Suppose the continuous- time quantum walk (U c t(P(G)), 8t) 
threshold-mixes. Then the O (log d) -repeated walk (U c t(P(G d )),5Td) threshold-mixes. 



Proof: The Hamiltonian H' = P(G d ) is related to the Hamiltonian H = P(G) by the identity: 

d 



d 

H' = - ® H <g) I® {d - j) (23) 

i=i 

Since H' commutes with the identity /, which can introduce at most a global phase factor to the 
system, the Markov chain P[ generated by the walk (U c t(P(G d )),St) is the dth tensor power of the 
Markov chain P t j d generated by the walk (Uct(P(G)), 5t/d)- By assumption, d(Px) < a for some 
constant a < 1. By Proposition 12.21 we can choose T' = O(logd) to ensure that: 

l\\(P T f - ul% < d((P T f )<^L (24) 

Then at least n^/2/3 entries in each column of (Pt) T ' are bounded below by 1 l ' 2d , otherwise we 
would have the contradiction 

\\\{Pt) T ' - ul% = 1 - J2 mm{(P T ) T '(y,x), 1} > (1 - ^3)^ > ± ( 25 ) 



where the first equation uses the identity ^| \p — q\ |i = 1 — J2 k mm{pk, q^} for distributions p, q and 
the last inequality uses simple algebra along with the fact that for any d > 1: 

U-^>f (26) 

Since (P^) T ' = ((-Pr) T ')® d , at least (n0j3) d = |n d of the entries in each column of (-Py d ) T ' are 
bounded below by (——^—) d > ^j. It follows from Proposition 12.31 that {P^d) 7 " threshold-mixes 
in time 0(1). ■ 

We have the following corollary for the dth power of the cycle ^ n . 

Corollary 4.3 (Mixing on periodic lattices) Let Z, d be the d-dimensional periodic lattice with 
n > 2 vertices per side. The O (log d) -repeated continuous-time quantum walk (?7 c t(P(Z„)), u) n d/2) 
with measurement uo E {5, £L, fl} threshold-mixes. 
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Proof: Combining Lemma [4.11 with Theorem 14.21 we conclude that (£/c{(P(Z„)), St) threshold- 
mixes for any T £ [| • ^]. It is easy to see (cf. Lemma 14. If) that this is sufficient to imply the 
stated corollary not only for the measurement u = 5 but also for the time-averaged measurements 
£l and fx. ■ 

For d > 1, this extends the results of Fedichkin et al. [141 [32l 133] by confirming 0(n) and 
0(dlog(i) scaling (suggested by their analytical estimates and numerical experiments in regimes of 
both high and low decoherence) of the fastest-mixing walk, which they conjectured to be decoherent 
rather than unitary. 



4.2 The Grover walk 

An important question is whether there is a T'-repeated Grover walk ([/^ (-P(Z^)), u>t) that threshold- 
mixes for T = 0(nd) and T' = 0(log d). Szegedy [33] showed that the phase gap (minimum nonzero 
eigenvalue phase from [— tt, tc] in absolute value) of Udt{P) is 0,("/S) and exploited this property to 
prove a quadratic quantum speedup for the hitting time of any symmetric Markov chain P. A nat- 
ural adaptation of his argument to the mixing time setting would be something like the following: 
since the phase gap of Udt(P) is 9 = £l(VS), we expect to see by decomposing the action of Udt{P) 
along spectral components that roughly 0{\/9) = 0(V S^ 1 ) timesteps suffice for the orbit of any 
initial basis state to "cover" the entire state space with sufficient amplitude. 

Unfortunately, this argument is incorrect: in fact, the orbit may remain quite localized around 
the initial basis state. This happens with dramatic effect to the Grover walk on the complete 
graph G = Kjy, which mixes in time T 1 = @(N) (cf. [31]) even though the classical random walk 
on Kn mixes in a single timestep. It also happens to the Grover walk on Z^ for d = 2, albeit 
less dramatically \L7\ [35] 125] The probability distribution pt induced on the vertices of Z^ at 
time t < n/2 is primarily localized at the initial basis state [T7] but has substantial secondary 
spikes which propogate across the lattice in orthogonal directions [25] , In particular, the standard 
deviation of pt appears to grow linearly with t [25] and the mixing time of the Grover walk on Z^ 
with measurement u> = v appears to be T = O(n). In the high-dimensional regime, Moore and 
Russell [29] proved that the Grover walk on Zr[ with measurement uj = 5 mixes almost perfectly 
in time T = O(d). It seems plausible that the decoherent Grover walk on Z^ with measurement 
uj = v mixes as fast asymptotically as the continuous-time walk. 



5 Conclusion and open problems 

We have shown that decoherent quantum walks have the potential to speed up a large class of 
classical MCMC mixing processes. Exactly how large this class is, and whether a generic quantum 
mixing speedup to 0(V <5 -1 log 1/tt*) is possible, remain important open questions. Since it seems 
that quantum walks can outperform classical random walks in low-dimensional examples and un- 
derperform in very high-dimensional examples (such as the complete graph), a hybrid method may 
work best for generic Markov chains consisting of both low- and high-dimensional substructures. 
Also worth investigating is whether by randomizing the "coin" used in discrete-time quantum walks 
we can improve mixing on the complete graph and other adversarial examples. 

7 For d — 1 it does not; the Grover walk on Z n with measurement u = v mixes perfectly in T = n timesteps. 
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